Speaker Transformation Algorithm Using Segmental Codebooks (stasc) X
نویسنده
چکیده
This paper presents a new voice conversion algorithm which modiies the utterance of a source speaker to sound like speech from a target speaker. We refer to the method as Speaker Transformation Algorithm using Segmental Codebooks (STASC). A novel method is proposed which nds accurate alignments between source and target speaker utterances. Using the alignments, source speaker acoustic characteristics are mapped to target speaker acoustic characteristics. The acoustic parameters included in the mapping are vocal tract, excitation, intonation, energy, and duration characteristics. Informal listening tests suggest that convincing voice conversion is achieved while maintaining high speech quality. The performance of the proposed system is also evaluated on a simple Gaussian mixture model based speaker identiication system, and the results show that the transformed speech is assigned higher likelihood by the target speaker model when compared to the source speaker model. z Permission is hereby granted to publish this abstract separately.
منابع مشابه
Speaker Transformation Algorithm using Segmental Codebooks (STASC)
This paper presents a new voice conversion algorithm which modi®es the utterance of a source speaker to sound-like speech from a target speaker. We refer to the method as Speaker Transformation Algorithm using Segmental Codebooks (STASC). A novel method is proposed which ®nds accurate alignments between source and target speaker utterances. Using the alignments, source speaker acoustic characte...
متن کاملSpeaker transformation using sentence HMM based alignments and detailed prosody modification
This paper presents several improvements to our voice conversion system which we refer to as Speaker Transformation Algorithm using Segmental Codebooks (STASC)[2]. First, a new concept, sentence HMM, is introduced for the alignment of speech waveforms sharing the same text. This alignment technique allows reliable and high resolution mapping between two speech waveforms. In addition, it is obse...
متن کاملSubband Based Voice
A new voice conversion method that improves the quality of the voice conversion output at higher sampling rates is proposed. Speaker Transformation Algorithm Using Segmental Codebooks (STASC) is modified to process source and target speech spectra in different subbands. The new method ensures better conversion at sampling rates above 16KHz. Discrete Wavelet Transform (DWT) is employed for subba...
متن کاملVoice conversion by codebook mapping of line spectral frequencies and excitation spectrum
This paper presents a new scheme for developing a voice conversion system that modiies the utterance of a source speaker to sound like speech from a target speaker. We refer to the method as Speaker Transformation Algorithm using Segmen-tal Codebooks (STASC). Two new methods are described to perform the transformation of vocal tract and glottal excita-tion characteristics across speakers. In ad...
متن کاملSubband based voice conversion
A new voice conversion method that improves the quality of the voice conversion output at higher sampling rates is proposed. Speaker Transformation Algorithm Using Segmental Codebooks (STASC) is modified to process source and target speech spectra in different subbands. The new method ensures better conversion at sampling rates above 16KHz. Discrete Wavelet Transform (DWT) is employed for subba...
متن کامل